Correlation Structure and Fat Tails in Finance: a New Mechanism. 



Marco Airoldi 

Risk Management & Research, Intesa-Bci Bank 

Via Clenci 4, Milan, ITALY 

(February 1, 2008) 



O 
O 



O 
en 









a 
o 
o 



> 

m 

in 

o 



a 



X3 

o 
o 



X 



Fat tails in financial time series and increase of stocks cross- 
correlations in high volatility periods are puzzling facts that 
ask for new paradigms. Both points are of key importance in 
fundamental research as well as in Risk Management (where 
extreme losses play a key role). 

In this paper we present a new model for an ensemble of stocks 
that aims to encompass in a unitary picture both these fea- 
tures. Equities are modelled as quasi random walk variables, 
where the non-Brownian components of stocks movements are 
leaded by the market trend, according to typical trader strate- 
gies. 

Our model suggests that collective effects may play a very 
important role in the characterization of some significantly 
statistical properties of financial time series. 

FAGS numbers: 89.65. Gh, 89.75.-k, 05.40.-a, 01. 75. -Km 



I. INTRODUCTION 

One of the most important goals in the appHcation of 
Physics to Finance ||l|,g] and in Risk Management, is to 
model accurately financial time series. 
Indeed physicists have become more involved in the study 
of financial systems and financial time series, among all 
complex systems, due to the huge amount of available 
data in electronic format, a situation that makes easier 
to check paradigms and theories. 

On the other hand, practitioners working in finance need 
to understand financial time series in order to: (i) as- 
sess correctly the market risk associated to portfolios and 
(ii) make a more rational price evaluation of derivative 
products, a task where the knowledge of stochastic pro- 
cess followed by the underlying asset is a key element. 
(E.g. the largely accepted Black-Scholes formula for op- 
tion pricing [pi is based on the Gaussian hypothesis for 
asset returns. Moreover many attempts to justify smiles 
in implied volatility structures are based to alternative 
dynamics for price variations M.) 

The modelling of financial time series started at the be- 
ginning of the 20th century. In a pioneering work, Bache- 
lier showed that stock price fluctuations could be mod- 
elled as a random walk (i.e. a Brownian motion) [g. In- 
deed more accurate analyses of financial data have shown 
that geometric Brownian motion is inadequate to de- 
scribe correctly empirical distributions when large events 
occur |§-jll| . In order to account for fat tails in empirical 



data, Mandelbrot, in his 1963 paper, proposed to model 
financial time series through a levy stable process [^. 
More recently Mantegna and Stanley pointed out that a 
levy flight distribution is an accurate description only for 
the central part of the distribution of returns while for 
the extreme tails an exponential fltting is more appropri- 
ated; in particular they suggested that the distribution of 
price changes is consistent with a truncated Levy flight 
distribution Q. 

Other empirical investigations considered the time evolu- 
tion of volatility (i.e. the second moment of price returns 
distribution) ||Jl^,|l^,|l| . The analysis of high frequency 
data has shown that two different regimes take place: a 
super-diffusive behaviour for short time intervals (where 
volatility scales with time as a{t) ^ t^ , (3 ~ 0.8 > 1/2) 
and a regime close to diffusive behaviour {cr{t) ^ t^/^) 
for long times (see ||l[, par. 7.1). Clearly the standard 
geometric Brownian motion does indeed account only for 
a diffusive regime. 

The discussions made above concern the univariate case, 
where only a single financial time series is considered. 
The extension of the one-dimensional Brownian motion 
to a set of N assets is the so-called multivariate geo- 
metric Brownian motion. Today most of the analyses 
in Risk Management are based on this model, in spite 
of its limitations |]1^. Within the multivariate Brown- 
ian motion the cross-correlations between stocks are sup- 
posed to be constant and independent from market ac- 
tivity. Indeed empirical evidence has shown that correla- 
tions between stocks are strictly dependent from market 
volatility and increase sharply during turmoil |ll5|-pi|, a 
phenomenon called correlation breakdown. Moreover it 
has been demonstrated that a couple of correlated series, 
following a geometric Brownian motion, are asymptot- 
ically independent (i.e. extreme events occur indepen- 
dently) [p2 14|. This means that the covariate model is 
unable to describe the correlation structure of extreme re- 
turns. All these facts have serious consequences on Risk 
Management [g3|,^. In fact, if we use a covariate model 
to estimate the market risk (i.e. the risk due to unex- 
pected variations of market conditions) of a portfolio of 
stocks, we under-estimate the probability of simultane- 
ous losses and consequently under-estimate the possible 
portfolio claims. On the other hand, from a theoretical 
point of view, in recent years an increasing attention has 
been devoted to the study of ensembles of stocks traded 
in the same financial market p5[ , with an emphasis on 
correlation structure characterization |2l| . 

Starting from this context, we introduce a new model 



for a set of stocks that aims to encompass in a unitary 
picture some of the most important features of financial 
time series, that is: time evolution of volatility, fat tails 
and correlation structure. For this model we show that 
a super-diffusive process characterizes the time scaling 
of volatility for intermediate time intervals (with a criti- 
cal exponent 3/4) while a s tanda rd diffusive regi me ta kes 
place for long times (par. IIIB). In paragraph [II C we 
discus s how fat tails can emerge in the system. In para- 
graph HID a Monte Carlo simulation approach is used to 
investigate the correlation structure of the model. Finally 
Section IV is devoted to discuss briefly some conclusions. 



II. THE MODEL 

As discussed in the Introduction, one of the most in- 
teresting characteristics of financial markets is the in- 
crease of cross-correlations when large market events oc- 
cur. Empirical evidence shows that for large market re- 
turns almost 90% of the equities have the same sign as 
that of the market ||21| . In other words during market 
crisis or running days most of stocks follow the general 
trend. This suggests that the probability for an indi- 
vidual stock to have a positive (negative) return can be 
affected by a market increase (decrease). Basing on this 
consideration we introduce a one step model for an en- 
semble of N equities, {Si}'^^, where each stock move- 
ment, 5Si = ±s, follows a quasi random walk, with a hop- 
ping probability, PsSi , depending on the previous market 
return. Precisely: 



P,,<.(M(-^*)) = i + i 



1 SS, 
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Since for large N the normalized market return, M, is 
expected to be small compare to 1, it turns out that: 

(i) the function g, satisfying all the above constraints, 
can be well approximated as: 



g{M) = Af - 7M3 



0<7< 1 ; 



(3) 
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(1) 



(ii) g, in eq. (Q), is indeed a correction respect to the 
random walk contribution. 

Some considerations can be made on the model de- 
fined above. First of all we observe that the model (with 
the choice (0)) contains a symmetry between upward and 
downward movements. Actually, as pointed out in the lit- 
erature (see for instance |2^ ) , the increase of correlations 
for large market events is thought to be asymmetric, with 
a stronger interlink dependence between equities during 
crash periods. Indeed, it is very simple to incorporate 
this feature in our model by considering two different 
values of 7: a 7+ for upward moves and a lower value, 
7^, for downward moves. However in order to simplify 
the analysis we maintain 7+ — 7^. 

Secondly, from a heuristic point of view the model pro- 
posed has a strong analogy with a critical Ising model 
in infinite dimensions, it is therefore not surprising that 
critical (i.e. power law) behaviours can emerge. 
Finally, it is clear that the model we have introduced is 
quite schematic: we treat all the equities on the same 
foot and apart fr om co rrelations induced by the market 
growth (see par. HID), no explicit correlations are con- 
sidered. However we can expect that a more complete 
model would maintain the basic features we are going to 
discuss. 



where: 
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is the normalized market return. At is the time re- 
quired to complete one step and 17 is a function such that 
I 9{M) |< 1. (To simplify the formulas in the following 
we have chosen s = 1 and At = 1.) 

The factor 1/2, in equation (nl), represents the usual ran- 
dom walk contribution (i.e. a 50% chance to perforin 
a positive or a negative step). The function g incorpo- 
rates the typical trader strategy: when market blows up 
some traders (the bulls) assume that the market will con- 
tinue to rise (this behaviour is modelled by imposing the 

following condition on g: limM->o jm ~ ^^ ^^ ^^^ *-*?" 
posite hand other agents will consider this positive trend 
a good opportunity to realise a profit by selling their po- 
sitions (thus we require that | g{M) \<\ M \ for M ^ 0). 
Finally, a rising market reflects into an upward general 
tendency for the stocks belonging to that market (as in 
crash periods most of equities drop dramatically as the 
market does); hence g must have the same sign of M . 



III. RESULTS AND DISCUSSION 

In this section we present, relatively to our model, 
some results regarding fat tails and correlation struc- 
tures. We resort to analytical calculations (para- 



graphs III A and IIIB) as well as Monte Carlo simula- 
tions (par. HID ). 



A. Probability distribution of market returns 

The model discussed in the previous section, can be 
regarded as a system of N random walkers with a hopping 
probability depending on the previous mean hopping. 
If we indicate with P(j, t, M) the probability to find a 
generic walker at site j, after a time t, when the mean 
hopping at time t—1 is M , then the time evolution master 
equation for P is given by: 



P(j,t+1,M)= Y. G{M,M) 



M=-l 



i±^P(j - 1, t, M) + ^-^P(j + 1, t, M) 



(4) 



where: 



G{M,M)^ 



m 



(^N-^f)\(N-^]\ 



lAT-i 



,Af-i 



(5) 



and P{j, t, M) obeying to the initial condition: 
Pij,t = 0,M) = 6,^0- 5mm- 

Interestingly, G{M, M) can be regarded as a sort of tran- 
sition matrix from a state characterized by a market re- 
turn M to a state M. 

The probability distribution of market returns over 
a time horizon of one step is given by Po{t,M) = 
Y,-P{j,t,M). From eq. (|), it follows that: 



Po (i + 1 , M) = Yl <^(^^' ^'^) ^0 {t. M) 



(6) 



j\/=-i 



After a transient period, the solution of eq. (^) converges 
to a stationary state. In fig. 0, the solution obtained by 
solving numerically the eq. (0) is presented (solid line) 




Market return M 



FIG. 1. The distribution of market returns over a time 
horizon At (i.e. one step) for A*' — 200 and 7 = 0.2. Solid line: 
exact result; dotted line: analytical expression (|9|); points: 
Monte Carlo simulation. 

An analytical solution of eq. (0) can be worked out, 
in the limit of large N, by approximating the opera- 
tor f{f{M)) "^^ J2m G{M, M)f{M) through differential 
operators. Indeed by using the Stirling formula and con- 
sidering the Taylor expunction of / at the second order 
in SM ~ M — M, it is possible to show that: 



+ - 



Y, G{M,M)f{M) « /(A?) 
1 52 



M=-l 



dM 



^ [iM^im 



2N dAp 



(i-A72)/(M)] , (7) 



where the above approximation holds in the limit of large 
N and under the following restrictions on the generic 
function /: 

7Af 2 < 1 VAf : f{M) is significantly ^ , (8a) 



1 d^fjM) 
f{M) dAP 



<iV 



(8b) 



Setting f = Pq and substituting eq. (|^) in eq. (^, we 
obtain a second order differential equation for Pq; its an- 
alytical solution is given by: 



Po(Af) = C {l-M 



/y-2^7^-1 jNM^ 



(9) 



where C denotes a constant such that X]j\/^o(-^^) — 1- 
Interestingly, we observe that the above solution satisfies, 
for A^ -^ 00, both conditions (g). As a definitive check we 
compare, in figure ffl, the probability distribution of mar- 
ket returns obtained with different methodologies, that 
is: (i) by solving numerically the exact equation (||) ; (ii) 
by using the analytical expression (S) and (iii) by resort- 
ing to a Monte Carlo simulation. All the three curves 
are quite close (the analytical and the exact solutions 
are almost indistinguishable). 

It is interesting to compare the analytical solution (|) 
with the corresponding expression for a set of N inde- 
pendent equities. In such a case the market return is 
the sum of N uncorrelated variables assuming the values 
±s. By applying the central limit theorem, the probabil- 
ity distribution of market variations turns out to be the 
usual Gaussian function: 



Pr 



un-coiT. 



(A/) 



\/2^ 



(10) 



As one can realise immediately, the market volatility, 
(Ta/, computed from the above probability distribution, 
scales with N as N^2^ in contrast with our model where 
aM ~ N~ 4 . This means that the inter-dependence 
among equities has the effect to maintain the market 
volatility still very large even for large value of N. As a 
consequence the extreme market price movements in our 
model happen far more often than would be expected 
by chance. This characteristic is one of the statistical 
properties of real financial indexes pointed out in litera- 
ture [|l|. 



B. Power law in time scaling of volatility 

The volatility of a single stock over a horizon t, crs(i), 
(which, in our model, is indeed the same for all equities) 
is given by: a|(i) = E,,m f PU: t, M). 
By using the eq. (|^), it turns out that crs(i) satisfies the 
following finite-difference equation: 



a|(i + l)-a|(t) = l + 2. ^ 5(M).Pi(t,M), 

(11a) 



A/=-l 



where Pi {t, M) '^= J2j J P{h t, M) must obey to: 

1 
Pi (t + 1, M) = J2 '^(^^' ^^)^i (*' ^^) + ^^ -P'o(^^) ' 

(lib) 



M=-l 



with the initial condition Pi{t = 0, A/) = 0. 

Note that if we substitute to the matrix G{M, M) its 
approximated expression (H) and represent the finite dif- 
ference Pi{t + 1, M ) — Pi{t,M) by a time derivative, 
the equation ( |llb| ) reduces to a Fokker-Planck equa- 
tion with a drift —"/M^ and a diffusion coefficient "'"^j^^^ 
(the factor MPq can be always re-adsorbed by redefining 
Pi (t, M) — Pi {t, M) —p{M), for an appropriate function 
P)- 



Regarding the time evolution equation of volatility (11a), 
the first contribution represents the usual diffusive pro- 
cess while the second term is originated by the infiuence 
of market trend on stock returns. If this last term has 
a power law time dependence t^ {S > 0) then the stock 
volatility is characterized by a super-diffusive process. 
In order to investigate the time scaling behaviour of equa- 
tions (^, we set Pi{t, M) = h{t, M) ■ Pq{M). Substitut- 
ing @) into ( |llb| ), the equation for h reads: 



dh ^ r-i dh 1 , , ,9 

dt ' dM 2N ^ 



dAP 



M 



(12) 



Interestingly, we note that the analysis of the finite- 
difference version of eq. (|l2|) shows that Pi can be al- 
ways expressed as a finite sum of the form Pi{t,M) — 
Sfe=i '^k,t M'^^~^ Po(M). Now it is simple to prove that 
M" Po(M) fulfil the conditions (|) only for y/nj/N < 1 
and therefore we can expect that Pi satisfies the rela- 
tions (0) only in the range of times t such that y^t^/N ^ 
1. Hence the Partial Differential Equation (PDE) (|l|) 
can be regarded as a good approximation for the evolu- 
tion of h only for t ^ N/'^. As a result the time evolution 
of (7s{t) is characterized by three regimes: 



(i) 



< t < 



h{t,M) satisfies the PDE (12) 



JV 
7 " " " 7 ■ 

The long time scaling behaviour of h is investigated 
in Appendix ^. Precisely, it can be proved that if 

^ -^ \f^ ^^^^ ^M 5(^) -Pi(^. ^) ~ ^^ and there- 
fore crs{t) ~ ti. In such a case the system is char- 
acterized by a super-diffusive process. Note that 
the critical exponent 3/4 is quite close to the em- 
pirical result of 0.8 reported in literature (see p], 
par. 7.1). 



(u) t comparable to — . The PDE ( [T^ ) fails in describ- 
ing the time evolu tion of h. The numerical solution 
of equation (lib) shows that Pi{t,M) converges 
towards a stationary state (see fig. ^). As a conse- 
quence (Ts{t) '^ i2 and therefore the super-diffusive 
scaling behaviour breaks down; 

(iii) an intermediate regime between the two, where 
volatility scales faster than t^ but without a sharp 
power law behaviour. 




FIG. 2. Time derivative of variance. 



di 



versus time. 



for TV — 200 and 7 = 0.1. For large t (i.e. t comparable to A''), 
the curve converges to a constant value (standard diffusive 
regime). Indeed for A'' ~ 100 the range of scaling behaviour 
is very narrow. 

The behaviour reported above for the time evolution of 
volatility is indeed typical of financial time series (see [y , 
par. 7.1). 



C. Fat tails 

Using the above results we can have some insights on 
the presence of fat tails in the probability distribution of 
stock price returns. Indeed, we can expect that for N 
sufficiently large, most of the time the market growth, 
M, is relatively small (from eq. (^ limjs[^oaPo{M) = 0) 
and therefore the majority of stocks follow a geometric 
Brownian motion (P55. (Af) « 1/2). On the other hand 
during extreme market movements (crashes or running 
days), the walkers have a higher probability, respect to 
a simple random walk, to move far from the origin (ex- 
treme events). Therefore is reasonable to imagine that 
the central part of the stock price variations distribution 
is approximately near to a standard Gaussian distribu- 
tion while for the tails a fatter non-Gaus sian s hape is ex- 
pected. Indeed the results of subsection IIIB show that 
the stock volatility grows faster than the Brownian term 
y/t (at least until t is of order TV/7). This is coherent with 



the hypothesis of fat tails in the distribution of stock re- 
turns, which indeed provide an extra-contribution to the 
volatihty growth. Since the time evolution of volatil- 
ity reaches a standard diffusive behaviour for t compa- 
rable to -/V/7 (as indicated by the numerical simulation 
reported in fig. y) we can suppose that also fat tails would 
disappear on long time scales, reducing the process to a 
pure Gaussian stochastic random walk. This is in accor- 
dance with the empirical evidence observed in financial 
time series, where a crossover between a leptokurtic dis- 
tribution for short times and a Gaussian behaviour for 
long time intervals takes place B. 



D. Cross-correlations during turmoil 

As we have seen in the introduction, ensembles of 
stocks are characterized by an increase of correlations 
during critical periods. This behaviour is qualitatively 
well reproduced within our model. 
In figures ^ and ^, some Monte Carlo results are given. 




Curve C (t^lOO) 
un-corr. stocks 
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FIG. 3. Conditional probability for a stock to have the 
same sign of the market return for different values of the time 
horizon: t = 10; 100 (curve A and B). In the figure is also 
reported the probability distribution corresponding to a set of 
independent stocks over a time horizon t — 100 (curve C). The 
Monte Carlo results are obtained considering 200 samples, 
N = 200 and 7 = 0.2. On the x-axis the market returns (over 
a time t) are measured in unit of market standard deviation 
(over a horizon t). 

Specifically, in fig. 0, we present the conditional prob- 
ability for stocks to have the same sign as that of the 
market. The results show a strong sign dependence as 
a function of the market return. Interestingly, that de- 
pendence increases further considering larger time hori- 
zons (compare curve A and B in fig. ||). On the other 
hand, for a set of uncorrelated stocks, the sign condi- 
tional probability is substantially weaker (see fig. g) and 
remains stable increasing time. 
The above results are consistent with the empirical obser- 



vations reported by Bouchaud et al |2^. They consider 
a set of 450 U.S. equities, obtaining the same shape of 
fig. ^, with a 90% of the stocks have the same sign as that 
of the market return for very large market movements. 
Figure H clearly shows that correlations between stocks 
are enforced during high- volatility periods. Indeed in a 
one step model for an ensemble of N stocks, it is always 
true that the mean stocks cross-correlation, p, is related 
to market variance, a^ = (M^) — (M)^ (over a horizon 
of one step), through the relation: p w af.j — ■^. How- 
ever in our model the spreading of market volatility is 
enormously accentuated respect to the uncorrelated case 
(see fig. ^ for a comparison). As a consequence, in our 
model, we have effectively periods characterized by high 
volatility and strong cross-correlations alternated to nor- 
mal market activities. 




Collective model 



■ Un-correlated stocks 



Market Variance 

FIG. 4. Conditional correlation between stocks as a func- 
tion of market variance afj. As the market becomes more 
volatile the stocks become more interlinked. The results in 
the figure are obtained using a Monte Carlo with 200 samples, 
an observation interval of 100 steps, N = 200 and 7 — 0.2. 



IV. SUMMARY AND CONCLUSIONS 

In this paper we have proposed a new model that 
aims to capture some of most puzzling aspects of sta- 
tistical properties of financial time series. Specifically, 
we have focalised our attention on two open problems: 
(i) leptokurtic behaviours in the probability distribution 
of stock returns and (ii) inter-dependence between mar- 
ket volatility and correlation structure when an ensemble 
of equities is considered. Both aspects cannot be taken 
into account by the standard approach commonly used 
in finance to describe correlated risk factors (that is the 
covariate Gaussian model). 

We have introduced a new model that encompasses in 
a coherent picture both points (i) and (ii). The mecha- 
nism we have proposed is based on the interplay between 
equities movements and market trend. 

The results obtained suggest two considerations: (i) fat 
tails and correlation structure of extreme returns could 



be indeed strictly connected; (ii) an ensemble of stocks 
can be regarded as a "critical system" , where the collec- 
tive inter-dependences among equities play a key role. 

Our analysis may be of interest, from a theoretical 
point of view, in clarifying statistical properties of en- 
sembles of stocks and, from an applied perspective, in 
improving portfolio market risk estimations. 



APPENDIX A: 

This short appendix is devoted to determine the long 
time scaling behaviour of the solution of PDE ( [l2|) , sat- 
isfying the initial condition: h{M, t = 0) = 0. 
For 1/A^ = a close solution of equation ( [l2|) can be 
worked out: 



h'^°\M,t) 



2 M 



7 \M 



t + 



1 



2-fA'P V 27M2 



1 



(Al) 



On the other hand, although for finite N an exact solu- 
tion cannot be found, it is still possible to investigate the 
behaviour of h in the long time regime. This is done by 
considering a perturbative solution of equation (Oh re- 
spect to parameter ^: h{M,t) = Y.^ {j^Y h'-^HM,t). 
It is a matter of algebra to show that each function 

h'-^^ contains only terms of the form [t + 1/ (2^M^)'\ ^ , 
< fc < 2j; this suggests to group together equal terms 
and look for a solution of the form: 



h{M,t) 



t + 



M 



Ea-w 



27M2 



fe=0 

7-fe 



27M2 



,-k- 



(A2) 



It turns out that each function fk , which indeed depends 
on iV, satisfies the recursive equation: 



„ ,, M3 dfk (ffk 
^ 1 - M2 dM dAf 2 



7i\f4 



Ik-1 



2 dfk-i 



7M3 dM 



1 



/fe-2 







^2 '7 72M6 
supplemented with the global condition: 

fe=0 ^ ^ 



(A3) 



(A4) 



It is possible to demonstrate the following statements 
about {fk}- 

(i) /o must be a constant; 



(ii) for any given M and fc > 1 : limjv_,oo fk{M) = 0; 

(iii) for any given M: limjv^oo /2fe+i(M)//2fe(M) = 0; 

(iv) each fk is a regular function apart from a diver- 
gence in M = 0: fk ^ M-^'Uk > 1). (Hence for 
M ^ aU the terms in eq. (|A^ ) behave like 1/M.) 

Basing on the above properties, it is straightforward to 
argue that the long time scaling behaviour of the solu- 
tion of PDE ( p^ ) is given by the dominant term /q (i.e. 
the term corresponding to iV = c» apart from a multi- 
plicative coefficient), even for small values of M. In the 
language of Renormalization Group (RG), we say that 
the term in eq. (12) proportional to -^ is irrelevant. (An 
overview of the application of RG ideas to PDEs is given 
in H.) 

To summarise, for large times compare to }.i , the so- 
lution of PDE dH) behaves like h^°'>{M,t). Hence for 
t ^ — p^7 we have that h{M,t) ^ t^ and a power law 
behaviour emerges in the model. 
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